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Abstract. We survey some connections between topological dynamics, semigroups 
of ultrafilters, and combinatorics. As an application, we give a proof, based on ideas 
, of Bergelson and Hindman, of the Hales-Jewett partition theorem. 

■3 

Furstenberg and his co-workers have shown [15, 16, 17] how to deduce combina- 
torial consequences from theorems about topological dynamics in compact metric 
K*" ! spaces. Bergelson and Hindman [4] applied similar methods in non-metrizable 
^ I spaces, particularly the Stone-Cech compactification /3N of the discrete space of 
CN ' natural numbers. This approach and related ideas of Carlson [11] lead to par- 
ticularly simple formulations since many of the basic concepts of dynamics, when 



■ applied to can be expressed in terms of a semigroup operation on the nat- 
^ ' ural extension of addition on N. The semigroup /3N can also substitute, in many 
contexts, for the enveloping semigroups ([14]) traditionally used in topological dy- 
J3 ■ namics. Further simplifications and applications of these ideas were developed in 
[3]. 

The purpose of this paper is to survey some of these developments. In contrast to 
^ . most surveys, however, we include some detailed proofs, in order to emphasize their 
simplicity. In the first three sections, we develop the necessary theory of dynamics 
and the equivalent semigroup structure in (3N. In the fourth section, we apply the 
theory to present proofs of Hindman's partition theorem for finite sums and of the 
Hales-Jewett theorem about homogeneous combinatorial lines in cubes. A final 
section (omitted for lack of time in the talk on which this paper is based) compares 
the ultrafilters discussed in the earlier sections with other ultrafilters traditionally 
related to combinatorics, for example selective ultrafilters. 



1. Ultrafilters 

Throughout most of this paper we are concerned with ultrafilters on the set N 
of natural numbers. These are usually defined by some version of the following 
set-theoretic definition, in which we have included some redundant clauses for ease 
of future reference. 
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Definition 1. An ultraRlter on N is a family U of subsets of N such that 

(1) If X C y and X e W then Y eU. 

(2) UX,Y eU then XDY eU. 

(3) ^ 

(4) neU. 

(5) For any X C N, either XeWorN-XeW. 

(6) liXUY eU then either X e W or y e W. 

The first four of these clauses define filters. 

The implications in (2) and (6) are reversible, by (1). 

To each o e N is associated a principal or trivial ultrafilter, namely d = {X C N 
I a e X}. In many contexts, we identify d with a. 

The following alternative definition expresses the usual way of viewing ultrafilters 
topologically. 

Definition 2. An ultrafilter on N is a point in the Stone-Cech compactification 
/3N of the discrete space N. 

The two definitions are equivalent in the following sense. For any point p G /3N, 
the family of subsets of N whose closures in (3N contain p satisfies Definition 1. 
Conversely, if U is as in Definition 1, then the closures in /5N of the sets in U have 
exactly one point p in common. And the constructions described in the preceding 
two sentences are inverse to each other. 

Although the two definitions of ultrafilters above are the most familiar ones, 
two other, equivalent definitions will be more useful for our purposes. The first 
of these uses the notion of a quantifier Q over N. This is an operation which 
applies to a formula (fi{n) with a free variable ranging over N and produces a new 
formula (Qn) f{n) in which n is no longer free; it is required that replacing f{n) 
by an equivalent formula il){n) yields an equivalent result (Qn) ilj{n). Formally, a 
quantifier can be identified with the set of those X C N for which {Qn) n e X is 
true. Under this identification, the following definition amounts to Definition 1. 

Definition 3. An ultrafilter on N is a quantifier U over N that respects the propo- 
sitional connectives in the sense that the following equivalences hold for all formulas 
(fi{n) and il^{n) 

(1) (Un) ip{n) A (Un) tpin) (Un) {ip{n) A tpin)) 

(2) (Un) </7(n) V (Un) ij{n) [Un) {<fi{n) V ^{n)) 

(3) -'{Un) (p{n) ■^=^ {Un) -'(p{n) 

IfU is an ultrafilter in the sense of Definition 1, then the corresponding quantifier 
{Un), usually read "for Z^-almost all n," is defined by 

{Un) (f{n) <^ {n e N I (fi{n)} e U; 

conversely, from a quantifier as in Definition 3 we can define 

U = {X CN\ {Un) n e X}. 

As all propositional connectives (of any number of arguments) can be expressed 
in terms of and A, they are all respected by ultrafilter quantifiers. 

Finally, we give another topological definition, which will provide a connection 
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Definition 4. An ultraGlter on N is a uniform operation on sequences in compact 
Hausdorff spaces. That is, it is an operator assigning to every sequence {xn)neN in 
every compact Hausdorff space X a point ZY-lim„ Xn & X subject to the requirement 
that, if / : X — > y is a continuous map to another compact Hausdorff space, then 

/(W-Hm„ Xn) = U-liuin f{Xn)- 

The easiest way to connect this definition with the previous ones is to notice that 
a sequence (xn) in X is a (continuous) function x from the discrete space N into X] 
if X is a compact Hausdorff space, then this map extends uniquely to x : /?N X, 
and so each p G j3N yields a point x{p) G X. Uniformity is easy to check, so 
an ultrafilter in the sense of Definition 2 yields one in the sense of Definition 4. 
Conversely, an operation as in Definition 4 can be applied to the sequence in /3N 
whose n^^ term is n, yielding a point p G (3N, and these constructions are inverse 
to each other. 

One can verify that W-lim„ Xn is the unique point in X such that every neigh- 
borhood G of it satisfies (Un) Xn G G. In particular, it follows that W-lim^ x^ is a 
limit point or a member of the sequence (xn)- Thus, a non-trivial ultrafilter can be 
regarded as a systematic way of passing to a limit point of any sequence. 

The trivial ultrafilter a corresponds in Definition 2 to the point a G N C /?N, in 
Definition 3 to the "quantifier" that just substitutes a for the quantified variable, 
and in Definition 4 to the operation that picks out the a^^ term from sequences. 

The set (3N of all ultrafilters on N admits a binary operation -|-, extending or- 
dinary addition on N (see for example [12, 23]). In the context of Definition 4, it 
amounts to an iteration of limit operations: 

(U + VyiimpXp = W-lim^(V-lim„a;^+„). 

Translating this into the language of quantifiers, one again finds an iteration: 

((U + V)p)ipip) <^ (Um)(Vn)(p{m + n). 

The equivalent characterizations in terms of Definitions 1 and 2 are more compli- 
cated, at least on first sight. Definition 1 leads to 

U + V = {X CN\{m\{n\m + neX}eV}eU}. 

And for Definition 2 we have the following description of addition. Start with 
ordinary addition -|- : N x N ^ N. Extend it by continuity to -|- : N x /3N PN, 
fixing the first argument in N and requiring continuity in the second. Then, fixing 
the second argument in PN and requiring continuity in the first, obtain an extension 

+ : /3N X /3N ^ (3N. 

Notice that the operation -|- on PN is a continuous function of the left summand 
for any fixed value in PN of the right summand, but it is not a continuous function 
of the right summand for a fixed left summand unless the latter is in N (see [23, 
Section 10]). I refer to continuity in the left argument as left-continuity, and I 
therefore call PN a left topological semigroup. (Caution: Some authors use "right" 
instead of "left" because the right translations are continuous, and some authors 
define W -|- V to be what I would call V + U; authors who disagree with me on both 
points therefore say "left," just as I do, though they mean the opposite.) 

The addition operation on PN is associative (most easily checked using the quan- 
tifier description of +); it is commutative as long as one of the summands is in N, 

J- ;„ „ 1 — J„j-„;l„ Too c — .j-; — inn 
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2. Dynamics 

Topological dynamics is concerned with the behavior of iterations of a continuous 
map T from a space X into itself. (Actually, it is considerably more general [14], 
but the preceding description covers what will be relevant here.) For the purposes 
of this paper, a dynamical system consists of a compact Hausdorff space X and a 
continuous function T : X ^ X. We write T"^ for the n*"^ iterate T o T o ■ ■ ■ o T 
of T. To study the limiting behavior of these iterates for large n, we define (as in 
[24]) for each ultrafilter W on N 

T^{x)=U-\imnT''{x). 

Regarded as a function of W G (3N, for fixed x E X , this is the continuous extension 
to PN of the function N ^ X : n ^ T"(a;). It follows that {T^ix) \ U G /5N} is the 
closure in X of the forward orbit {T"^(x) | n G N} of the point x. 

But regarded as a function of x for fixed W, T^{x) need not be continuous unless 
U is principal. If U is the principal ultrafilter n, then — T", so no confusion 
will be caused by identifying n with n in this context. 

The notion of iteration with respect to an ultrafilter, T^, connects nicely with 
the addition operation on ultrafilters in that 

T"(T^(x)) = T"+^(x). 

Indeed, we have 

T"+^(x) = (W + V)-limpTP(x) 

= U-limm V-Um„ T"^+'^(x) = W-hm^ V-hm^ T"^(T"(x)) 
= W-lim^ T'" ( V-hm„ (x) ) (as T"^ is continuous) 
= T^{T^{x)). 

We next introduce some concepts from topological dynamics, i.e., concepts about 
the behavior of T'^{x) for large n. In each definition, it is assumed that {X,T) is 
a dynamical system. More information about these concepts can be found in [14, 
15]. 

Definition. A point x E X is recurrent if, for each neighborhood G of x, infinitely 
many n G N satisfy T'^{x) G G. It is uniformly recurrent if, for each neighborhood 
G of X, there is M G N such that Vn 3k < MT'^+^{x) G G. 

Thus, recurrence means that, under the iteration of T, the point x returns to 
each of its neighborhoods infinitely often. Uniform recurrence bounds how long the 
sequence of iterates can stay out of any given neighborhood; there is M depending 
on G such that of every M consecutive iterates at least one is in G. 

Definition. Two points x,y e X are proximal if, for every neighborhood G of the 
diagonal in X x X, infinitely many n G N satisfy (T^(a;), T^(y)) G G. 

Proximality is usually defined in the context of metric spaces by requiring that, 
for every positive s, infinitely many n have the distance between T^{x) and T'^{y) 
smaller than e. This definition clearly makes use not of the full metric structure 
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space. A compact Hausdorff space has a unique uniform structure, and the defini- 
tion we gave for compact Hausdorff spaces is just the speciahzation to this case of 
the general concept in uniform spaces. 

The dynamical concepts just defined can be elegantly expressed in terms of 
ultrafilter iterations, as follows. 

Theorem 1. Let {X,T) be a dynamical system. 

(1) A point X E X is recurrent if and only if T^{x) = x for some non-trivial 
ultrafilter U on N, if and only if T^{x) — x for some U ^ Q. 

(2) A point X & X is uniformly recurrent if and only if for every ultrafilter V 
on N there is an ultrafilter U onN with T^{T^{x)) = x 

(3) Two points x,y E X are proximal if and only if there is an ultrafilter U on 
N with T^{x) = T^{y), if and only if there is a non-trivial ultrafilter U on 
N with T^{x)^T^{y). 

Proof. (1) By definition, recurrence means that x is a limit point of the sequence 
(T"(x)). That implies that x is in the closure {T^{x) | W G /3N - {0}} of {T^(x) | 
n e N - {0}}. This in turn implies that x e {T^{x) | W G /3N - N}, i.e., that we 
can take U non-trivial. Indeed, if U were trivial, say U — h ^ 0, then T"(a;) = x, 
so T'^^{x) = X for all A;, and therefore, if we take U' to be a non-principal ultrafilter 
containing the set of multiples of n, then [x) = x also. Finally, x e {T^{x) \ 
U e /3N — M} implies recurrence, since every T^(x) with non- principal W is a limit 
point of {T"(x) I n e N}. 

(2) Assume first that x is uniformly recurrent, and let an ultrafilter V be given. 
Temporarily fix a closed neighborhood G of x, and let M be as in the definition of 
uniform recurrence for this neighborhood. So Vn3/c < MT"+'^(a;) e G. As only 
finitely many /c's occur and as V is an ultrafilter, the same k must work for V- 
almost all n. Fix this k, so we have {Vn)T'^+^{x) e G. Equivalently, (Vn)T'^(x) G 
T-^{G). As T-^(G) is closed, T^{x) G T-^(G), and so T^{T^{x)) G G. Now un- 
fix G, and remember that, as X is a compact Hausdorff space, every neighborhood 
of X includes a closed neighborhood. So we have shown that, for every neighborhood 
G of X, the set 

Yg = {/c e N I T''{T^{x)) g G} 

is non-empty. Clearly, HIgj = ^0111021 ^ ^ ranges over the neighborhoods 
of X, the sets Yc generate a filter. Extend it to an ultrafilter U. Then we have, for 
each neighborhood G of x, (Uk) T^{T^{x)) G G, so r"(T^(x)) = x, as desired. 

Conversely, suppose x is not uniformly recurrent, and fix an open neighborhood 
G such that no M satisfies the definition of uniform recurrence. That is, for all 
M G N, the set 

= {n G N I (V/c < M) T''+^{x) ^ G} 

is nonempty. These sets Ym generate a filter, as they form a chain, so there is an 
ultrafilter V containing all of them. For every /c G N we have, since Yit+i ^ V, 
{Vn)T''^^{x) i G] so (Vn)T^(x) i T'^'iG); so, as T'^iG) is open, T^{x) i 
T~^{G); so T'^{T^{x)) ^ G. As k is arbitrary, it follows that, for any ultrafilter U, 
T^{T^{x)) i G, as desired. 

(3) Suppose T^{x) = T^{y). If W is principal, then proximality follows trivially, 
so suppose U is non-principal. To prove that x and y are proximal, let G be any 

:„l^^ 1 ] „f 1 ci;„„„ lrrUl„\ rriUl...\\ ii i;™ lrrnl^\ ^^n^',.^^ ;„ — 
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the diagonal by assumption, there must be infinitely many n (in fact ZY-almost all 
n) such that (T^(a;), T"(y)) e G, as required. 

Conversely, suppose x and y are proximal. So, as G ranges over neighborhoods 
of the diagonal, the sets 

YG^{neN\iT-{x),T^{y))eG} 

are non-empty, and they generate a filter (because Yq^ n 1^2 = ^01002)^ which 
wc extend to an ultrafiltcr U. Then wc have, for all closed neighborhoods G of the 
diagonal, (Un) (T"(x), T'^(y)) G G and so {T^ {x),T^ (y)) G G. But the intersection 
of all closed neighborhoods of the diagonal is just the diagonal, so we conclude that 
T^{x)=T^{y). □ 

We close this section by pointing out a simpler connection between dynamical 
systems and ultrafilters: ultrafilters provide the universal example of a dynamical 
system. The compact Hausdorff space /9N with the shift map S : /9N — > /?N : U 1— > 
1 + W is a dynamical system and enjoys the following universal property. If {X, T) 
is any dynamical system and x is any point in X, then there is a unique continuous 
map / : /3N — * X such that f o S = T o f and /(O) = x, namely the map defined 
by f{U) = T"(x). Thus, {(3N, S) may be regarded as the free dynamical system on 
one generator. 

Henceforth, when we refer to /?N as a dynamical system, we mean (/5N, S). 

3. Dynamics = Algebra 

The addition operation defined for /3N in Section 1 is just the ultrafilter iteration 
of the shift map, i.e., of the universal dynamical system. Indeed, we have, for any 
ultrafilters U and V on N 

^"(V) = U-liuin S''{V) = W-Um^(n + V) = U + V, 

where at the last step we used that ZY-lim commutes with continuous maps, like 
addition as a function of its left argument, and that W-lim„ n = U. 

Of course, this equivalence between iteration in (/3N, 5") and addition allows us 
to reformulate Theorem 1 algebraically. We do so in the following theorem, adding 
some more reformulations in terms of subsemigroups (i.e., non-empty subsets closed 
under addition) and ideals in the semigroup (/?N, +). A left ideal is a non-empty set 
/ C /3N such that \iU e I and V G /?N then V + U e I; right ideals and two-sided 
ideals are defined analogously. 

Theorem 2. 

(1) An ultrafilter U is recurrent in /5N if and only ifV + U = U for some V 7^ 0. 

(2) An ultrafilter U is uniformly recurrent in f3N if and only if, for each V, there 
is W with W + V +U = U, if and only ifU belongs to a minimal (closed) 
left ideal in 

(3) Two ultrafilters Ui and U2 are proximal in PN if and only if there is an 
ultrafilter V such that V + Ui = V + W2 . 

(4) An ultrafilter U generates a minimal closed subsemigroup of PN if and only 
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Proof. Parts (1), (3), and the first part of (2) are immediate consequences of the 
corresponding parts of Theorem 1 and the fact that S^iy) =U + V. 

To finish the proof of (2), notice first that every ultrafilter U generates a left 
ideal, namely (3N + V( = {V + U\ Vg (3N}. It follows that a minimal left ideal, 
being the ideal generated by any of its elements U, is closed, for it is the image of 
the compact space (3N under the continuous map adding U on the right. That is 
why "closed" is parenthesized in (2); putting it in or leaving it out doesn't aff'ect 
the statement. Now to say that a particular ultrafilter is in a minimal left ideal 
is to say that the left ideal /3N + U that it generates is minimal or, equivalently, 
is generated by each of its elements. That is, for every ultrafilter V, the ideal 
PN + V + U generated hjV + U must be all of pN + U. Equivalently, /?N + V + W 
must contain the generator U of /?N + W. But that means that, for every V, we can 
express U as W + V + W by suitably choosing W. This completes the proof of (2). 

(4), which is included in the theorem because of its analogy to (2), is trivial in 
one direction, as {U} is a closed subsemigroup if U is idempotent. To prove the 
non-trivial direction (due, as far as I know, to Ellis [14]), let C be a minimal closed 
subsemigroup of /?N and let U E C. Then C+U is also closed (being the image of the 
compact set C under a continuous map) and a subsemigroup of C, so by minimality 
it equals C. In particular it contains U. So the set D = {V&C\V + V{ = U} 
is nonempty. It is closed (being the pre-image of {U} under a continuous map) 
and also a subsemigroup of C, so it equals C and therefore contains U. That is, 
U + U = U, and the proof is complete. (It follows, of course, by minimality, that 
C = {U}.) □ 

The information in Theorem 2 about ideals and subsemigroups can be used to 
give quick existence proofs for the corresponding sorts of ultrafilters. 

Corollary. There exist uniformly recurrent ultrafilters. There exist non-trivial 
idempotent ultrafilters. 

Proof. The intersection of a chain of closed subsemigroups of /9N is again a closed 
subsemigroup; it is non-empty by compactness, and it is obviously closed topo- 
logically and closed under addition. By Zorn's Lemma, there are minimal closed 
subsemigroups of /9N— N. Their elements arc idempotent by (4) of the theorem. The 
same argument applied to closed left ideals yields uniformly recurrent points. □ 

The concepts characterized in Theorem 2 are related to each other as follows. 

Theorem 3. Each of the following statements about an ultrafilter U and the dy- 
namical system /3N implies the next. 

(1) U is uniformly recurrent and proximal to 0. 

(2) U is idempotent. 

(3) U is recurrent and proximal to 0. 

Proof. (1) =^ (2) Let U be uniformly recurrent and proximal to 0. By Theo- 
rem 2(3), fix V with V + W = V + = V. By Theorem 2(2), fix W with 

>V + V + W = W. 

Combining these two equations, we get W = W + V, and substituting this into the 
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(2) =^ (3) If U is idempotent, then the requirement V + U = U for recurrence 
and the requirement V + W = V for proximahty to (see Theorem 2) are satisfied 
by taking V = W. □ 

The preceding results connect the algebraic properties of /?N with its dynamical 
properties, but in fact, thanks to the universality of (3N among dynamical sys- 
tems, we easily get connections between the algebra of /3N and arbitrary dynamical 
systems. 

Theorem 4. Let {X,T) be a dynamical system and let x E X. IfU is (uniformly) 
recurrent in /3N then T^{x) is (uniformly) recurrent in X . If Ui andU2 are prox- 
imal in /3N then T^^{x) and T^^{x) are proximal in X. 

Proof. Each part is proved by combining the corresponding parts of Theorems 1 
and 2 with the fact that T^+^ = o T^. 

Suppose U is recurrent in /?N. So by Theorem 2(1) there is V with V + U = U. 
Then T^(T"(x)) = T^+^{x) = T"(x), so T"(x) is recurrent by Theorem 1(1). 

Suppose U is uniformly recurrent. By Theorem 2(2), for every V there is W with 
W + V + W = W and therefore T^(T^(T"(x))) = T^{x). By Theorem 1(2), T"(x) 
is uniformly recurrent. 

Finally, suppose Ui and U2 are proximal. By Theorem 2(3), there is V with 
V + Wi = V + ^2- Then T^{T^^{x)) = T^{T^^{x)). By Theorem 1(3), T^^{x) and 
(^x) are proximal. □ 

As an application of these connections between dynamics and algebra, we give a 
short proof of the Auslander-EUis Theorem [15]. 

Theorem 5. Let iX,T) be a dynamical system. For each x & X, there exists a 

uniformly recurrent y proximal to x. 

Proof. By the corollary of Theorem 2, there exists a uniformly recurrent V G (3N. It 
follows immediately that every ultrafilter of the form W-|- V is uniformly recurrent. 
The set /3N + V of such ultrafilters is a closed subsemigroup of /3N. By Zorn's 
Lemma, it includes a minimal closed subsemigroup. By Theorem 2(2), there is an 
idempotent U G /3N + V. Then U, being uniformly recurrent and idempotent, is 
also proximal to by Theorem 3. 

Now for X, T, and x as in the theorem, let y — T^{x). Then, by Theorem 4, y 
is uniformly recurrent and proximal to T^{x) = x. □ 

The property of ultrafilters, "uniformly recurrent and proximal to 0," which 
played a key role in the proof of Theorem 5, has alternative algebraic descriptions 
that will be useful later. To introduce them, we first define a partial ordering of 
the idempotent ultrafilters by 

U<V ^ U + V = V+U ^U. 

This definition and parts of the next theorem are from [4]. When we refer to an 
idempotent ultrafilter as minimal, we mean with respect to this ordering. 

Theorem 6. The following three assertions are equivalent, for any lA G /?N. 

(1) U is uniformly recurrent and proximal to 0. 

(2) U is idempotent and belongs to some minimal left ideal of 
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Furthermore, these equivalent conditions imply that tl belongs to every two-sided 
ideal of PN. Finally, every idempotent ultrafilter U is > a minimal idempotent. 

Proof. 

The equivalence of (1) and (2) is immediate from Theorem 2(2) and Theorem 3. 

To prove (2) =^ (3), assume (2), and suppose V is an idempotent < U. Since 
L( is uniformly recurrent by Theorem 2, choose W so that W + V + U =U, which 
reduces, in view oi V <U, to W + V — U. Using this, the idempotence of V, and 
again V <U, we compute 

V = U + V = W + V + V = W + V = U, 

so U is minimal. 

We next show that, if U is idempotent and / C /3N + U is a minimal left ideal, 
then there is an idempotent V < U m I. Since we already know that (2) implies 
(3), this gives the last sentence of the theorem; it will also be useful in establishing 
(3) =^ (2). So let such U and / be given. Being a closed subsemigroup of / 
contains an idempotent W. Being in (3N + U, this W satisfies W + U — W because 
U is idempotent. Let V = U + W. Then V belongs to the left ideal / because W 
does. From W + U = W and the idempotence of W and U, we infer 

V + W = W + >V + W = W + >V = V, 

U + V = U + U + W = U + W = V, 

and 

V + V = W + >V + W + >V = W + W + W = W + W = V, 

which mean that V <U, as desired. 

The proof of (3) =^ (2) is now easy. If is a minimal idempotent, apply Zorn's 
Lemma to get a minimal left ideal / C pN + as in the preceding paragraph, and 
let V be obtained as there. Being < U, this V must be equal to U by minimality. 
SoU el. 

Finally, we must prove that every ultrafilter satisfying (2) belongs to every two- 
sided ideal. In fact, that every minimal left ideal / is included in every two-sided 
ideal J. To see this, let U E I and V E J. Then / fl </ is non-empty because it 
contains V + U. So / fl J is a left ideal, and it must equal / because / is minimal. 
So / C J. □ 

4. Combinatorics 

In this section, we apply the results obtained above to give relatively easy proofs 
of some highly non-trivial combinatorial theorems. The first of these is Hindman's 
Theorem, first proved in [22]. A simpler proof was given by Baumgartner [1], but 
we shall give two yet simpler (given the preceding machinery) arguments, one due 
to Furstenberg [15] and the other to Galvin and Glazer [12, 19, 23]. 

Theorem 7. //N is partitioned into finitely many pieces, then there is an infinite 
H CN such that all finite sums of distinct members of H lie in the same piece. 

Furstenberg 's Proof. Let the given partition have k pieces, and regard it as a func- 

— T\T . 1 — j- „„j- t „j- v u„ ^-^ j- „f „ii .j-; t\t . 
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topologized by giving K the discrete topology and then giving X the product 
topology. Thus, K is a compact Hausdorff space, and the given partition is a point 
X e X. Let T : X ^ X he the shift map, defined by T{y){n) = y{n + 1); it is 
clearly continuous, so we have a dynamical system. By Theorem 5, let y G X be 
uniformly recurrent and proximal to x. We write out what these properties of y 
mean for our specific X and T. Uniform recurrence means that, given any n e N, 
there is A?" e N such that the initial segment (j/(0), y(l), . . . , y(n — 1)) of j/ recurs at 
least once in every segment {y{r), . . . , y{r -\- N — 1)) of y of length N. Proximality 
means that, given any N, there are infinitely many intervals of length N where x 
and y agree, {x{r), . . . , x{r + N — 1)) — {y{r), . . . , y{r + N — 1)). 

Let c — y{0). We intend to complete the proof by finding infinitely many natural 
numbers, all of whose finite sums are mapped to c by x. 

By uniform recurrence, find A^o such that c occurs at least once among every 
A^o consecutive terms in y. By proximality, find a place, beyond term 0, where A^o 
consecutive terms of x coincide with those of y and therefore contain a c. So we 
can fix /lo > with x{ho) = c. This ho will be the first member of our H. 

By uniform recurrence, find A^i such that among every A^i consecutive terms of y 
there are ho + 1 consecutive terms that coincide with y(0), . . . , y(/io); in particular, 
among every A^i consecutive terms, there are two terms a distance ho apart where 
y has the value c (the same as at and ho). By proximality, there are two places a 
distance ho apart where x has value c, say x{hi) = x{hi + ho) = c, with hi > ho- 
hi will be the next member of H. 

Repeating this process, we inductively choose hi so that, for all sums s of zero 
or more elements of {ho, . . . , /ii-i}, we have x{s + hi) = y{s + hi) = c. This is 
done by finding A^ such that every A^ consecutive terms of y contain a segment 
that coincides with the initial segment of y up to the largest s, and then finding a 
segment of length N beyond hi-i where x and y coincide. 

The set of all finite sums of distinct h^s is clearly included in the partition piece 
corresponding to c. □ 

Galvin's and Glazer's Proof. This proof uses ultrafilters directly rather than apply- 
ing them via Theorem 5 to dynamics on other spaces. Let U be any idcmpotcnt 
non-trivial ultrafilter on N (by the corollary to Theorem 2), and let C be the piece of 
the partition that is in U (by clause (6) of Definition 1). So we have {Un) n E C. As 
U is idempotent, we also have (cf. the quantifier form of the definition of -|- in /5N) 
(Un) (Uk) n + k e C. As ultrafilter quantifiers respect propositional connectives, 

(Un) [n eC A{Uk)n + ke C]. 

So we can fix ho with ho & C and (re-naming variables) (Un) ho + n & C. Using 
again that U is idempotent and respects connectives, we find 

(Un) [neC A (Uk) n + keCAho + neCA (Uk) ho + n + k e C]. 

So wc can fix hi having the four properties listed for n inside the brackets. In 
particular, hi & C and ho + hi G C. 

Repeating this process, we inductively choose hi so that, for all sums s of zero 
or more elements of {ho, ■ ■ ■ , /ii-i}, we have s + hi & C and (Un) s + hi + n E C. 
The latter property, when expanded by idempotence, ensures that it is possible to 
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as /ii+i.) Clearly, all finite sums of distinct members oi H = {hn | n e N} are in 
C. □ 

Notice that the Galvin-Glazer proof shows that the piece that contains the ho- 
mogeneous H can be taken to be any piece of the given partition that belongs to 
some idempotent ultrafilter. 

We turn next to an application of these ideas in the context of words over a finite 
alphabet, rather than natural numbers. We shall prove the Hales-Jewett Theorem 
[20, 19], but first we need some definitions and notational conventions. 

Let E be a finite set, which we call an alphabet, and let W be the set of words 
on E, i.e., the set of finite sequences of members of E. Let v be an object, called a 
variable, that is not in E; let A be the set of words on E U {v}; and \eiV = A — W . 
So V is the set of words on E U {v} in which v actually occurs; these are often called 
variable words over E. For each a e E, we define a function a : A ^ W sending 
each X ^ A to the result of substituting a for v in x; we call a{x) an instance of x. 
Notice that if x G then d(x) = x. 

With this notation, the Hales-Jewett Theorem [20] is as follows. 

Theorem 8. // W is partitioned into finitely many pieces, then there is an x & V 
whose instances all lie in the same piece of the partition. 

The proof is a special case of arguments from [3] . 

Proof. First, observe that A is a, semigroup under the operation of concatenation, 
that is a subsemigroup, that y is a two-sided ideal in A, and that each a is a 
homomorphism A W. The operation ^ can be extended to the Stone-Cech 
compactification jSA just as addition was extended to f3N. 

{U-Vx)<fi{x) ^ {Uy){Vz)<p{y-z). 

It is easy to verify that, in the compact left-topological semigroup f3W is a 
closed subsemigroup, PV is a closed, two-sided ideal, and the continuous extension 
of a, which we still call a : pA — > PW, is a homomorphism. 

The algebraic results about f3N proved earlier generalize easily to semigroups like 
PA and PW. We apply the analogs in this context of several parts of Theorem 6. In 
particular, there is a minimal idempotent W E pW. In this W is idempotent 
but not necessarily minimal. (In fact, we shall see in a moment that it is definitely 
not minimal.) There is a minimal idempotent V < W in PA. It belongs to every 
two-sided ideal, so V G PV. (In particular, V 7^ W.) 

For any a G E, since d is a homomorphism pA — > pW, we can infer from V < W 
that d(V) < d(W) = W (the last equality because a is the identity on W and hence 
on pW). By minimality of W in pW, it follows that a(V) = W. 

Now let W be partitioned into finitely many pieces, and let X be the piece that is 
in the ultrafilter W. For each a G E, we have a~^{X) G V because X eW = d(V). 
So riaex; ^~^(-^) non-empty. (In fact it is in V.) Any element x of this intersection 
clearly serves as the x required in the theorem. □ 

The proof actually establishes a stronger theorem, obtained by broadening the 
notion of "instance" to allow a specified, finite set of words in W (not merely 
single letters) as the a's being substituted for v. Unlike Theorem 8, this stronger 
form is non-trivial even in the case where E consists of just one letter; indeed this 
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Waerden's theorem is usually deduced from Theorem 8 for a 6-element alphabet by- 
using base b expansions of natural numbers; see [19].) 

5. P-POINTS 

In this section, we briefly discuss the connections between the ultrafilters dis- 
cussed earlier and other, perhaps more familiar (from [2, 3, 6, 8, 10, 13, 25, 26] for 
example), special ultrafilters. We begin with a pair of definitions. 

Definition. A non-principal ultrafilter W on N is selective if every function on N 
becomes one-to-one or constant when restricted to a suitable set in U. 

Definition. A non-principal ultrafilter on N is a P-point if every function on N 
becomes finite-to-one or constant when restricted to a suitable set in U. 

Both of these definitions have numerous equivalent forms. The versions above 
were chosen to make it obvious that all selective ultrafilters are P-points. The 
definition of P-point is just a combinatorial reformulation of the usual topological 
notion of P-point specialized to the space (3N: a point such that the intersection of 
any countably many neighborhoods is again a (not necessarily open) neighborhood. 

Selectivity also has a topological formulation, based on a theorem of Kunen [10] 
that characterizes selective ultrafilters U as those enjoying the following Ramsey 
property. If the set [N]^ of two-element subsets of N is partitioned into two pieces, 
then there is a set if G W all of whose two-element subsets lie in one piece. (The 
corresponding statement for [N]'^ holds for all finite k, and there are even infinitary 
generalizations; see [26].) If we identify [N]^ with the "above diagonal" subset 
{(a, 6) I a < b} of N^, then this Ramsey property says that [N]^ together with the 
sets HxH for H eU generate an ultrafilter on NxN. Let r : /3(NxN) ^ /3Nx/3N be 
the continuous extension of the inclusion map N x N ^ /3N x /3N. Then ultrafilters 
on N X N that contain H x H for all H E U are precisely those sent by r to 
{U,U). Thus (cf. [5, Section 10]), the Ramsey property is equivalent to saying 
that T~^{Ll,U) consists of exactly three points, namely the ultrafilter on N x N 
mentioned above, a symmetrical one "below diagonal," and an isomorphic copy of 
U concentrated on the diagonal. (Hindman [21] has shown that there are P-points 
U such that t~^{U,U) has the same cardinality, 2^ ", as (3N.) 

P-points also have a Ramsey-like property [2, Theorem 2.3]: If is a P-point 
and if [N]^ is partitioned into two pieces, then there is H eU and there is a function 
/ : N ^ N such that one piece of the partition contains all the two-element subsets 
{a, b} of H for which /(a) < b. 

Although P-points and selective ultrafilters, like the ultrafilters discussed in the 
previous sections, have interesting combinatorial and topological properties, they 
are quite different in several respects, of which we list three. 

First, since U+V = W-lim„(?i+V), while P-points are, in view of their topological 
description, never limit points of a countable set of other ultrafilters, it follows that 
no P-point can be of the form U + V. In particular, no P-point can be recurrent. 
So the family of P-points and, a fortiori, the subfamily of selective ultrafilters are 
disjoint from the families of ultrafilters studied in the preceding sections (recurrent, 
idempotent, etc.). 

Second, the ultrafilters in the preceding sections are proved to exist on the basis 
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of choice). In contrast, the existence of P-points or of selective ultrafilters is in- 
dependent of these axioms. More precisely, the continuum hypothesis (as well as 
weaker assumptions, like Martin's axiom) implies the existence of many selective 
ultrafilters and also many P-points that are not selective, but there are models of 
set theory with no selective ultrafilters [25] and even with no P-points [27, 28]. 

Finally, where Ramsey ultrafilters have t~^(U,U) as small as possible, namely 
of size 3, the following theorem shows that idempotent ultrafilters have it as large 
as possible. 

Theorem 9. IfU is an idempotent non-trivial ultrafilter on N, then there are 2^ ° 
ultrafilters V on N x N with t(V) = {U,U). 

Proof. Observe first that, for each natural number n, the set of multiples of n is 

in U. Indeed, as there are only finitely many congruence classes modulo n, any 
ultrafilter must contain one of them, so we can fix j such that < j < n and 
{Ux)x = j (mod n). Then {Ux){Uy) x + y = 2j (mod n), so for idempotent U 
it follows that {lAx)x = 2j (mod n). But then j = 2j (mod n) and so j = as 
claimed. 

In particular, for any n, W-almost all numbers x are divisible by 2" and therefore 
have O's as the last n digits in their binary expansions. 

Using this, we can proceed as in Galvin's and Glazer's proof of Theorem 7 to 
find, in any set C e W, a sequence ho,hi, . . . such that 

(1) All finite sums of distinct /ij's are in C. 

(2) For each z, hi+i is divisible by a power of 2 that is larger than hi. 

Note the following consequence of (2). If a and b are each a sum of distinct /?^'s and 
if no hi occurs in both sums, then the I's in the binary expansions of a and b occur 
in disjoint sets of positions. We define the meshing number m(a, b) to measure the 
amount of intermeshing between these disjoint sets; that is, m(a, b) is the length I 
of the longest sequence si, . . .,si such that for all odd (resp. even) z, there is a 1 
in position Sj of the binary expansion of a (resp. b). It is clear that every integer 
/ > 2 occurs as m(a, b) with a and b sums of different /i^'s; just take the first I of 
the hiS and let a (resp. b) be the sum of the odd (resp. even) numbered ones. 

In view of (1), this means that each of the infinitely many sets m~^{l} meets 
each set of the form C x C ioT C E U. So each m~^{l} supports an ultrafilter 
containing all these C x C and therefore mapping to iU^U) by r. This proves that 
T~^{IA,IA) is infinite. But every infinite closed subset of /3N (or the homeomorphic 
/3(N X N)) has cardinality 22''°; see [13, 18]. □ 

In spite of all these differences between P-points and selective ultrafilters on the 
one hand and recurrent and idempotent ultrafilters on the other, it is possible, using 
the continuum hypothesis (or Martin's axiom) to construct idempotent ultrafilters 
with strong connections to selective ultrafilters. For example, one can arrange that 
an idempotent ultrafilter be mapped to a selective one by the map N — > N that sends 
each natural number a to the position of the rightmost (or the leftmost) 1 in its 
binary expansion. For more information about such matters and for combinatorial 
applications, see [7, 9]. 
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